Text-conditioned Transformer for automatic pronunciation error detection
نویسندگان
چکیده
Automatic pronunciation error detection (APED) plays an important role in the domain of language learning. As for previous ASR-based APED methods, decoded results need to be aligned with target text so that errors can found out. However, since decoding process and alignment are independent, prior knowledge about is not fully utilized. In this paper, we propose use as extra condition Transformer backbone handle task. The proposed method output states consideration relationship between input speech a end-to-end fashion. Meanwhile, used decoder input, works feed-forward manner instead autoregressive inference stage, which significantly boost speed actual deployment. We set baseline model conduct several experiments on L2-Arctic dataset. demonstrate our approach obtain 8.4% relative improvement F1 score metric.
منابع مشابه
Comparing different approaches for automatic pronunciation error detection
One of the biggest challenges in designing computer assisted language learning (CALL) applications that provide automatic feedback on pronunciation errors consists in reliably detecting the pronunciation errors at such a detailed level that the information provided can be useful to learners. In our research we investigate pronunciation errors frequently made by foreigners learning Dutch as a se...
متن کاملAutomatic pronunciation error detection and guidance for foreign language learning
We propose an e ective application of speech recognition to foreign language pronunciation learning. The objective of our system is to detect pronunciation errors and provide diagnostic feedback through speech processing and recognition methods. Automatic pronunciation error detection is used for two kinds of mispronunciation, that is mistake and linguistical inheritance. The correlation betwee...
متن کاملAutomatic Pronunciation Error Detection Based on Extended Pronunciation Space Using the Unsupervised Clustering of Pronunciation Errors
Calculating posterior probability within a standard pronunciation space (SPS) is a common method in automatic pronunciation error detection (APED). However, to pronunciation errors outside the SPS, this kind of methods can only give an approximate solution, that may be not right in many applications. This paper expands the SPS to include more pronunciation errors, introduces a Bhattacharyya dis...
متن کاملAutomatic pronunciation error detection: an acoustic-phonetic approach
In this paper, we present an acoustic-phonetic approach to automatic pronunciation error detection. Classifiers using techniques such as Linear Discriminant Analysis or a decision tree were developed for three sounds that are frequently pronounced incorrectly by L2-learners of Dutch: /A/, /Y/ and /x/. The acoustic properties of these pronunciation errors were examined so as to define a number o...
متن کاملAutomatic Error Recovery for Pronunciation Dictionaries
In this paper, we present our latest investigations on pronunciation modeling and its impact on ASR. We propose completely automatic methods to detect, remove, and substitute inconsistent or flawed entries in pronunciation dictionaries. The experiments were conducted on different tasks, namely (1) word-pronunciation pairs from the Czech, English, French, German, Polish, and Spanish Wiktionary [...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Speech Communication
سال: 2021
ISSN: ['1872-7182', '0167-6393']
DOI: https://doi.org/10.1016/j.specom.2021.04.004